GeoStream: Spatial Information Indexing Within Textual Documents Supported by a Dynamically Parameterized Web Service
نویسندگان
چکیده
Cultural heritage content is everywhere on the web: digital libraries, archives, and portals of museums or galleries. Cultural heritage document collections are characterized by contents related to a territory and its land’s history. In this context, the GeoTopia project is supported by the CNRS-TGE-Adonis and focuses on archive data sharing and interpretation. It consists in a Content Management System (CMS) that aims to manage a repository of multimedia digital documents: it exploits information like origin, theme, period, area, etc. to index and/or query documents. Our contribution is dedicated to spatial information contained in non structured textual documents. More specifically, we have developed a process flow that can extract the spatial information contained in textual documents. This process flow indexes spatial information and computes precise geolocalized representations. We propose to encapsulate it into the GeoStream specific web service and to make its behavior dynamically customizable for easier integration into such platforms used for the management of cultural heritage electronic documents.
منابع مشابه
The SPIRIT Spatial Search Engine: Architecture, Ontologies and Spatial Indexing
The SPIRIT search engine provides a test bed for the development of web search technology that is specialised for access to geographical information. Major components include the user interface, geographical ontology, maintenance and retrieval functions for a test collection of web documents, textual and spatial indexes, relevance ranking and metadata extraction. Here we summarise the functiona...
متن کاملNormalizing Spatial Information to Improve Geographical Information Indexing and Retrieval in Digital Libraries
Our contribution is dedicated to geographic information contained in unstructured textual documents. The main focus of this article is to propose a general indexing strategy that is dedicated to spatial information, but which could be applied to temporal and thematic information as well. More specifically, we have developed a process flow that indexes the spatial information contained in textua...
متن کاملSpatio-textual Indexing for Geographical Search on the Web
Many web documents refer to specific geographic localities and many people include geographic context in queries to web search engines. Standard web search engines treat the geographical terms in the same way as other terms. This can result in failure to find relevant documents that refer to the place of interest using alternative related names, such as those of included or nearby places. This ...
متن کاملDefining a Workflow Process for Textual and Geographic Indexing of Documents
Many public organizations are working on the construction of spatial data infrastructures (SDI) that will enable them to share their geographic information. However, not only geographic data are managed in these SDIs, and, in general, in Geographic Information Systems (GIS), but also many textual documents must be stored and retrieved (such as urban planning permissions and administrative files...
متن کاملHybrid Indexing and Seamless Ranking of Spatial and Textual Features of Web Documents
There is a significant commercial and research interest in locationbased web search engines. Given a number of search keywords and one or more locations that a user is interested in, a location-based web search retrieves and ranks the most textually and spatially relevant web pages. In this type of search, both the spatial and textual information should be indexed. Currently, no efficient index...
متن کامل